tucSage: Grammar Rule Induction for Spoken Dialogue Systems via Probabilistic Candidate Selection
نویسندگان
چکیده
We describe the grammar induction system for Spoken Dialogue Systems (SDS) submitted to SemEval’14: Task 2. A statistical model is trained with a rich feature set and used for the selection of candidate rule fragments. Posterior probabilities produced by the fragment selection model are fused with estimates of phraselevel similarity based on lexical and contextual information. Domain and language portability are among the advantages of the proposed system that was experimentally validated for three thematically different domains in two languages.
منابع مشابه
SAIL-GRS: Grammar Induction for Spoken Dialogue Systems using CF-IRF Rule Similarity
The SAIL-GRS system is based on a widely used approach originating from information retrieval and document indexing, the TF -IDF measure. In this implementation for spoken dialogue system grammar induction, rule constituent frequency (CF ) and inverse rule frequency (IRF ) measures are used for estimating lexical and semantic similarity of candidate grammar rules to a seed set of rule pattern i...
متن کاملSemEval-2014 Task 2: Grammar Induction for Spoken Dialogue Systems
In this paper we present the SemEval2014 Task 2 on spoken dialogue grammar induction. The task is to classify a lexical fragment to the appropriate semantic category (grammar rule) in order to construct a grammar for spoken dialogue systems. We describe four subtasks covering two languages, English and Greek, and three speech application domains, travel reservation, tourism and finance. The cla...
متن کاملOn-Line Learning of a Persian Spoken Dialogue System Using Real Training Data
The first spoken dialogue system developed for the Persian language is introduced. This is a ticket reservation system with Persian ASR and NLU modules. The focus of the paper is on learning the dialogue management module. In this work, real on-line training data are used during the learning process. For on-line learning, the effect of the variations of discount factor (g) on the learning speed...
متن کاملOn-Line Learning of a Persian Spoken Dialogue System Using Real Training Data
The first spoken dialogue system developed for the Persian language is introduced. This is a ticket reservation system with Persian ASR and NLU modules. The focus of the paper is on learning the dialogue management module. In this work, real on-line training data are used during the learning process. For on-line learning, the effect of the variations of discount factor (g) on the learning speed...
متن کاملAn Integrated Approach to Robust Processing of Situated Spoken Dialogue
Spoken dialogue is notoriously hard to process with standard NLP technologies. Natural spoken dialogue is replete with disfluent, partial, elided or ungrammatical utterances, all of which are difficult to accommodate in a dialogue system. Furthermore, speech recognition is known to be a highly error-prone task, especially for complex, open-ended domains. The combination of these two problems – ...
متن کامل